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METHODS FOR SYSTEMATIC IDENTIFICATION OF PROTEIN - PROTEIN 

INTERACTIONS 

FIELD OF THE INVENTION 

The present invention relates to a method for identifying protein-protein interactions. 
The interacting proteins are isolated by affinity chromatography and identified by mass 
5 spectrometry. The affinity chromatography method allows for the purification of interacting 
proteins which are identified using mass spectrometry techniques. The invention provides a 
method for the high throughput analysis of protein-protein interactions that lends itself to 
yD automation. 

S BACKGROUND OF THE INVENTION 

y. The genome sequencing projects are providing vast amounts of information. With the 

^ whole genome of many organisms, including humans, complete or nearing completion, the next 
□15 challenge involves the characterization of the gene products. Surprisingly, little is known about 
5 the functions of most proteins that the genes encode, or how these proteins interact to control 
cellular functions. 

Protein interactions are intrinsic to virtually every cellular process. Most proteins in cells 
20 function in multi-subunit complexes of proteins created by specific protein-protein interactions. 
Many of the protein-protein interactions involved in cellular processes are too weak to allow co- 
purification of the interacting species by conventional methods from cellular extracts. The 
relatively weak binding is generally expected as proteins that must reversibly interact with each 
other in the concentrated intracellular environment will rapidly dissociate in a comparatively 
25 protein mixture. As the characterization of protein-protein interactions requires the in vitro 
reassembly of multi-subunit protein complexes, it is important to have methods for identifying 
and purifying all of the interacting proteins starting with one member of a protein complex. 



1 



The method of the invention uses a form of protein-affinity chromatography for the 
detection of protein-protein interactions. This method has been used in a non-systematic manner 
and has distinct advantages over other methods for the detection of protein-protein interactions, 
such as the two-hybrid method and co-immunoprecipitation. The two-hybrid system consists of 
5 two components, a target protein (the "bait"), fused to a DNA binding domain which binds to a 
specific region of DNA upstream of a reporter gene, and a protein (the "prey") fused to an 
activation domain which, when brought in close proximity of the reporter gene, can initiate 
transcription. Usually the "bait" protein is known and the "prey" protein is derived from 
genomic or cDNA libraries in order to isolate the interacting partner to the bait. The advantage 
10 of the two-hybrid system is that when an interactor is found the gene sequence can be determined 
directly. This advantage is becoming increasingly less important as the full genomic sequence of 
many organisms becomes available, making the identification of gene sequence from protein 
sequence routine. The two-hybrid system yields a very high percentage of false positives, is very 

5 labor intensive and does not easily lend itself to automation, making it a poor choice for high 

%J15 throughput analysis. 

ffl Protein-protein interactions have commonly been detected by antibody co- 

immunoprecipitation. Co-immunoprecipitation depends on the strength of a secondary protein- 
8 protein interaction, rather than on direct binding to the antibody. The technique is normally 
u20 limited to relatively strong interactions with > 10" 9 M. Additionally, it is not as sensitive as 
W protein-affinity chromatography, because the concentration of the antigen is low. 

O Protein-affinity chromatography offers distinct advantages as a technique for detecting 

protein-protein interactions. Protein affinity chromatography allows sensitive detection of 

25 protein-protein interactions. This method can detect interactions ranging in strength from K d 10 -5 
to 10 10 M. This limit is within the range of the weakest interactions likely to be physiologically 
relevant, which is estimated to be about 10- 3 M. Formosa et al., Methods in Enrymology 1991, 
208, 24-45 . An interacting protein with a > 1 0" 5 M may not remain bound to the column 
when the column is washed with buffer in order to lower the nonspecific binding of proteins 

30 from the extract to the column material. 

Protein-affinity chromatography tests all proteins in an extract equally for binding to the 
ligand protein. Thus, extract proteins that are detected have successfully competed for the 
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interaction with the ligand protein against the rest of the population of proteins in the extract. 
Additionally, interactions that are dependent on a multi-subunit complex, including the ligand 
protein and multiple extract proteins and/or cofactors, can be detected. Both the domains of a 
protein and critical residues within the protein responsible for a specific interaction can be 
5 examined for affinity to extract proteins by the use of mutant derivatives of the ligand protein. 

The method of the invention allows for the isolation of specific protein interactors. The 
interacting proteins are identified by protease digestion followed by mass spectrometry. During 
the past decade, new techniques in mass spectrometry have made it possible to accurately 
1 0 measure with high sensitivity the molecular weight of peptides and intact proteins. These 
techniques have made it much easier to obtain accurate peptide masses of a protein for use in 
databases searches. Mass spectrometry provides a method of protein identification that is both 
very sensitive (10 fmol - 1 pmol) and very rapid when used in conjunction with sequence 
5 databases. Advances in protein and DNA sequencing technology are resulting in an exponential 
Ql 5 increase in the number of protein sequences available in databases. As the size of DNA and 
Pi protein sequence databases grows, protein identification by correlative peptide mass matching 
ffl has become an increasingly powerful method to identify and characterize proteins. 

s Historically, the explosion in gene sequence information has far outpaced the 

SO characterization of gene products. The processes of isolation and identification of protein 
W interactors have represented a bottleneck in the characterization of protein-protein interactions. 
S Current methods for the isolation and identification of protein interactors are performed on a 
Q protein-by-protein basis with relatively low throughput. The method of the invention provides a 

process for the high throughput analysis of protein-protein interactions. The use of micro- 
25 columns and mass spectroscopy provide the basis for using high throughput methods. The use of 

multiple ligand concentrations provides the binding curves and assures the reliability of the 

interactions that are identified. 

30 SUMMARY OF THE INVENTION 

The method of the invention provides a process for the identification of interacting 
proteins that is suitable for high throughput analysis and amenable to automation. 
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The identification of protein interactions is performed using affinity chromatography 
followed by mass spectrometric analysis. Cellular extract or extracellular fluid is loaded onto 
multiple experimental micro-columns, those with bound ligand protein, and a control micro- 
column with no bound ligand protein. Each of the experimental micro-columns contains a 

5 different concentration of ligand bound to the matrix support. A fixed volume of cellular extract 
is chromatographed through each micro-column. Only affinity chromatography buffer (ACB) is 
chromatographed on a second control micro-column which contains the highest concentration of 
ligand bound (coupled) to the matrix support. The components of the eluate are separated, for 
example, on the basis of apparent molecular weight using SDS-PAGE, and visualized, for 

1 0 example, by protein staining. The interacting protein is observed to vary in amount in direct 
relation to the concentration of coupled protein ligand. The bands of interest are excised from 
the gel and analyzed using mass spectrometric techniques. 

SJ15 BRIEF DESCRIPTION OF THE DRAWINGS 

W Figure 1 is a SDS-polyacrylamide gel run with the salt and SDS eluates from the affinity column 
J using the S. aureus protein SA0005 as the ligand. The interacting protein is easily discerned 

from the background non-specific binding proteins as the band intensity increases with the 
EI20 increasing ligand concentration, but does not occur in the no-ligand and ACB controls. 

n Figure 2 is the mass spectrum of the tryptic peptides of the interacting protein excised from the 
Q gel of Figure 1 . The technique used to obtain the spectrum is MALDI-TOF mass spectrometry. 
The peptide masses were used to identify the interacting protein as a truncated form of EF-Tu. 

25 

Figure 3 is a SDS-polyacrylamide gel run with the salt and SDS eluates from the affinity column 
using the S. aureus protein SA0146 as the ligand. The interacting protein is easily discerned 
from the background non-specific binding proteins as the band intensity increases with the 
increasing ligand concentration, but does not occur in the no-ligand and ABC controls. 

30 

Figure 4 is the mass spectrum of the tryptic peptides of the interacting protein excised from the 
gel of Figure 3. The technique used to obtain the spectrum is MALDI-TOF mass spectrometry. 
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The peptide masses were used to identify the interacting protein as a concerved hypothetical 
protein of unknown faction. 

Figure 5 is a SDS-polyacrylamide gel run with the salt and SDS eluates from the affinity column 
5 using the S. aureus protein SA0203 as the ligand. The interacting protein is easily discerned 
from the background non-specific binding proteins as the band intensity increases with the 
increasing ligand concentration, but does not occur in the no-ligand and ACB controls. 

Figure 6 is the mass spectrum of the tryptic peptides of the interacting protein excised from the 
10 gel of Figure 5. The technique used to obtain the spectrum is MALDI-TOF mass spectrometry. 
The peptide masses were used to identify the interacting protein as a homologue of peptide chain 
release factor 3. 

S Figure 7 is a SDS-polyacrylamide gel run with the salt and SDS eluates from the affinity column 
Nl 5 using the S. aureus protein S A0276 as the ligand. The interacting proteins are easily discerned 
L 4 ! from the background non-specific binding proteins as the band intensities increases with the 
K increasing ligand concentration, but do not occur in the no-ligand and ACB controls. 

Figure 8 is the mass spectra of the tryptic peptides of the interacting proteins, interactor 1 and 
y20 interactor 2, excised from the gel of Figure 7. The technique used to obtain the spectrum is 
y MALDI-TOF mass spectrometry. The peptide masses from the respective spectra were used to 
O identify the interacting proteins as homologues of glutamyl-tRNA Gin amidotransferase subunits 
— A and B. 

25 Figure 9 is a SDS-polyacrylamide gel run with the salt and SDS eluates from the affinity column 
using the S. aureus protein SA0526 as the ligand. The interacting protein is easily discerned 
from the background non-specific binding protein as the band intensity increases with the 
increasing ligand concentration, but does not occur in the no-ligand and ACB control. 

30 Figures 10 is the mass spectra of the tryptic peptides of the interacting protein excised from the 
gel of Figure 9. The technique used to obtain the spectrum is MALDI-TOF mass spectrometry. 
The peptide masses were used to identify the interacting proteins as a homologue of EF-Tu. 
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Figure 1 1 is a polyacrylamide gel run with SDS eluates from the affinity column using the S. 
aureus protein SA0808 as the ligand. The interacting proteins are easily discerned from the 
background non-specific binding protein as the band intensity increases with the increasing 
ligand concentration, but does not occur in the no-ligand and ACB controls. 

5 

Figures 12a and 12b are the mass spectra of the tryptic peptides of the interacting proteins, 
interactor 1, interactor 2 (Figuel2a), interactor 3 and interactor 4 (figure 12b), excised from the 
gel of Figure 11. The technique used to obtain the spectra is MALDI-TOF mass spectrometry. 
The peptide masses from the respective spectra were used to identify the interacting proteins as 
10 homologues of elongation factor G, trigger factor (prolyl isomerase), formate-tetrahydrofolate 
ligase, and EF-Tu. 

Figure 13 is a polyacrylamide gel run with SDS eluates from the affinity column using the S. 
§ aureus protein SA0989 as the ligand. The interacting proteins are easily discerned from the 
Ml 5 background non-specific binding protein as the band intensity increases with the increasing 
IjJ ligand concentration, but does not occur in the no-ligand and ACB controls. 

m Figure 14 is the mass spectra of the tryptic peptides of the interacting proteins, interactor 1 and 

interactor 3, excised from the gel of Figure 13. The technique used to obtain the spectra is 
[l20 MALDI-TOF mass spectrmetry. The peptide masses from the respective spectra were used to 
W identify two of the interacting proteins as homologues of trigger factor (prolyl isomerase) and 
Q enolase. The third is unidentified. 

Figure 15 is a polyacrylamide gel run with SDS eluates from the affinity column using the 
25 unknown S. aureus protein SA1 094 as the ligand. The interacting protein is easily discerned 
from the background non-specific binding protein as the band intensity increases with the 
increasing ligand concentration, but does not occur in the no ligand ACB controls. 

Figure 16 is the mass spectrum of the tryptic peptides of the interacting protein excised from the 
30 gel of Figure 15. The technique used to obtain the spectrum is MALDI-TOF mass spectrometry. 
The peptide masses were used to identify the interacting protein as as a homologue of a putative 
peptidase. 
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Figure 17 is a polyacrylamide gel run with SDS eluates from the affinity column using the S. 
aureus protein SA1 1 85 as the ligand. The interacting proteins are easily discerned from the 
background non-specific binding protein as the band intensity increases with the increasing 
ligand concentration, but does not occur in the no-ligand and ACB controls. 

5 

Figure 18 is the mass spectra of the tryptic peptides of the interacting proteins, interactor 1 and 
interactor 2, excised from the gel of Figure 17. The technique used to obtain the spectra is 
MALDI-TOF mass spectrometry. The peptide masses from the respective spectra were used to 
identify the interacting proteins as homologues of glucose-6-phosphate isomerase and cysteine 
10 synthetase. 

Figure 19 is a polyacrylamide gel run with SDS eluates from the affinity column using the S. 

aureus protein SA1203 as the ligand. The interacting protein is easily discerned from the 
5 background non-specific binding protein as the band intensity increases with the increasing 
Ql 5 ligand concentration, but does not occur in the no-ligand and ACB controls. 

W Figure 20 is the mass spectrum of the tryptic peptides of the interacting protein excised from the 

J gel of Figure 19. The technique used to obtain the spectrum is MALDI-TOF mass spectrometry. 

* The peptide masses were used to identify the interacting protein as a homologue of NADH 

L20 dehydrogenase. 

O DETAILED DESCRIPTION 
25 Protein ligand 

The term ligand refers to a protein to be immobilized on the column support (matrix). 
The possible protein ligands include naturally occurring proteins, modified proteins, synthetic 
proteins and subdomains or fragments of proteins. Most typically, the protein to be used as a 
ligand is used in a heterologous system. Ideally, the protein to be used as the ligand should be 
30 pure. Preferably, the ligand is at least 90% pure. This ensures that the interacting proteins that 
are detected are binding to the intended ligand rather than a contaminant. The preferred method 
of obtaining protein, if the gene is available, is through the use of fusion proteins. If, for 
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technical reasons, an impure ligand must be used, it is important to use a control preparation that 
mimics the contaminants but does not contain ligand. 

The ligand protein to be used for affinity chromatography can be encoded by the DNA of 
a virus or any living organism. The DNA fragment to be cloned is typically identified from the 
gene sequence when the genome of the organism is partly or entirely known. Isolation of the 
DNA fragment is performed, for example, by gel electrophoresis after digestion with a DNA 
restriction enzyme, by random fragmentation or by amplification from genomic DNA or a 
recombinant clone by using the polymerase chain reaction (PCR). 

DNA encoding the protein or protein fragment is cloned into an expression vector. The 
wide availability of recombinant DNA technology makes it feasible to generate expression 
systems that can produce sufficient quantities of a selected protein for use as a ligand in the 
method of the invention. The steps for protein production include: generation of the protein 
expression systems, over-expressing the protein and purifying the protein. The generation of a 
clone for any particular gene of interest, and its incorporation into a suitable expression vector, is 
now a straightforward task that can be done in a parallel fashion for high throughput production. 
Edwards et al., Nature Structural Biology 2000, 7, 970-972. The selection of target proteins from 
completely sequenced genomes can take advantage of the availability of these cloned genes. 
However, even if a clone of a particular protein of interest is not readily available, it has now 
become a routine operation to generate a cDNA clone for almost any particular protein from a 
wide variety of organisms. 

To obtain expression of a cloned nucleic acid, the expression vector for expression in 
bacteria contains a strong promoter to direct transcription, a transcription/translation terminator, 
and if the nucleic acid encodes a peptide or polypeptide, a ribosome binding site for translational 
initiation. Suitable bacterial promoters are well known in the art and described, e.g., in 
Sambrook et al. and Ausubel et al. Bacterial expression systems are available in, e.g., E. coli, 
Bacillus sp., and Salmonella (Palva et al., Gene 22:229-235 (1983); Mosbach et al., Nature 
302:543-545 (1983). Kits for such expression systems are commercially available. 

Post-translational modification of the ligand protein may be related to the protein's ability 
to interact with other proteins. In certain cases eukaryotic expression systems are preferred, 
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where post-translational modifications are important, for example, glycosylation. Eukaryotic 
expression systems for mammalian cells, yeast, and insect cells are well known in the art and are 
also commercially available. In some cases, it may be preferable to employ expression vectors 
which can be propagated in both prokaryotic and eukaryotic cells, enabling, for example, nucleic 
5 acid purification and analysis using one organism and protein expression using another. 

Transfection methods used to produce bacterial, mammalian, yeast or insect cells or cell 
lines that express large quantities of protein are well known in the art. These include the use of 
calcium phosphate transfection, polybrene, protoplast fusion, electroporation, liposomes, 
1 0 microinjection, viral vectors and any of the other well known methods for introducing cloned 
genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell (see, e.g., 
Sambrook et al., supra). After the expression vector is introduced into the cells, the transfected 
cells are cultured under conditions favoring expression of protein, which are then purified using 
= standard techniques. 

IH The protein is expressed in suitable amounts for use as the ligand. There are several 

ffl expression systems that have been extensively studied. Some of these include: 1) bacterial (E. 
m coli), 2) methylotrophic yeast (Pichia pastorisis), 3) viral (baculovirus, adenovirus, vaccinia and 
s some RNA viruses), 4) cell culture (mammalian and insect), and 5) in vitro translation. 
U 20 Although the expression of any particular protein may be idiosyncratic, the availability of these 
W and other expression systems significantly increases the ability to produce large quantities of 
p protein. 

In situations in which relatively large amounts of relatively pure protein in native form 
25 are required, it will be desirable to employ expression systems characterized by high expression 
levels and efficient protein processing, including cleavage of signal peptides and other 
post-translational modifications. The baculovirus expression system is widely used to express a 
variety of proteins in large quantities. In addition to fulfilling the above requirements, the size of 
the expressed protein is not limited, and expressed proteins are typically correctly folded and in a 
30 biologically active state. Baculovirus expression vectors and expression systems are 
commercially available (Clontech, Palo Alto, CA; Invitrogen Corp., Carlsbad, CA). 
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Once a protein has been expressed to an acceptable level, the protein is purified from the 
other contents of the cell system that was utilized for expression. Highly purified protein is often 
desirable for further analysis according to the method of the invention. The proteins can be 
expressed fused to tags that aid subsequent purification or measurement techniques. Typical tags 
5 bind specifically to particular affinity matrices, allowing the attached protein to be purified 
without regard to its physical or biochemical characteristics. Such tags can then be cleaved, 
leaving the protein in its native form. Examples of tags include histidine rich sequences which 
bind to various metal ions, glutathione-S-transferase (GST) tags which selectively bind to 
glutathione, maltose-binding protein, or an epitope for an available monoclonal antibody. The 
1 0 recombinant protein to be used as a ligand is purified from the cells of the heterologous system 
by a chromatographic procedure that makes use of the tag on the protein. Examples of such 
procedures include, but are not limited to, nickel chelate chromatography, chromatography on a 
glutathione column, or chromatography on a suitable antibody column. In certain cases, the 
S fusion protein also includes a cleavable sequence of amino acids between the protein of interest 
,: Jl 5 and the tag sequence whereby the tag can be cleaved from the protein of interest. Typically, this 
Sjf is accomplished with a protease that cleaves the sequence under conditions where the protein of 
W interest is not degraded, or with an intein sequence, which allows for internal cleavage of the 
m protein. Alternatively, the tags provide a method for specifically anchoring proteins to a column 
; support. 

W Alternatively, the ligand protein is purified by other acceptable methods known in the art, 

Q for example by immuno-chromatographic methods. Specific antibodies that recognize the ligand 
O protein are generated in a number of organisms using ligand protein, or a portion of it. The 

antibodies are linked to a column support and used to purify the ligand protein from a cellular 
25 extract or other source. 

The ligand protein is coupled directly (through a covalent linkage) to commercially 
available pre-activated resins as described in Formosa et al., Methods in Enzymology 1991, 208, 
24-45; Sopta et al, J. Biol. Chem. 1985, 260, 10353-60; Archambault et al., Proc. Natl. Acad. 
30 Sci. USA 1997, 94, 14300-5. Alternatively, the ligand protein is tethered to the column support 
through high affinity binding interactions. If the ligand is expressed fused to a tag, such as GST, 
the fusion tag can be used to anchor the ligand protein to the matrix support, for example 
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Sepharose beads containing immobilized glutathione. Column supports that take advantage 
these tags are commercially available. 



It is preferred that the ligand protein be coupled to the column matrix by a covalent 
5 linkage. The coupling procedures typically make use of the many primary amino groups (lysines 
and the amino-terminal residues) on the surface of the protein. Any coupling chemistry which 
makes use of primary amines is appropriate, but a reactive chemical moiety should be used 
which reacts at a reasonable rate at the physiological pH which is most appropriate for the ligand 
and the extracts to be used in the procedure (e.g., N-hydroxy-succinimide works well at pH 7.5- 
10 8.0). Commercially available column supports have reactive moieties for coupling to proteins, 
for example, cyanogen bromide-activated Sepharose (Pharmacia) or N-hydroxysuccinimide- 
activated agarose matrix, available as Affi-Gel 10 (Bio-Rad). 

Failure to detect an interacting protein can result from inactivation of the ligand protein 
1 5 during coupling to the column support. Ideally, each ligand protein is randomly tethered to the 
matrix through one covalent bond. When the ligand is attached randomly, some of the 
immobilized protein molecules will always be oriented in such a way as to be able to interact 
with the proteins in the extract. 

20 The ligand protein is contacted with the matrix under conditions that are favorable for 

coupling. The solid support beads are mixed and shaken gently, tumbled or rotated with solution 
containing the protein ligand. Alternatively, the protein ligand solution is reacted with the 
activated solid support which is already packed into a column. The latter method, using a pre- 
packed column, is preferred as it typically uses less ligand and is amenable to automation and 

25 high throughput analysis. The concentration of salt and the pH are adjusted to be appropriate for 
the resin and the protein ligand that are being used. 

To achieve optimal sensitivity, it is important to choose a matrix that will couple a 
maximum concentration of protein without introducing potentially denaturing multiple cross- 
30 links to individual proteins. A matrix is chosen to minimize the non-specific interactions 
between proteins from the extract and the matrix. The matrix support is chosen from, for 
example, agarose, sepharose, glass beads, latex beads, cellulose, or dextran, To detect 
interactions efficiently, the concentration of the ligand protein bound to the matrix should be at 
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least 10-fold higher than the K d of the interaction. Thus, the concentration of the ligand protein 
bound to the matrix should be high for the detection of weak protein-protein interactions. 

The coupling is done at various ratios of the protein ligand to the resin, with the upper 
5 limit of the protein : resin ratio determined by the isoelectric point and the ionic nature of the 
protein. The use of a concentration series for the protein ligand allows one to obtain an estimate 
for the strength of the protein-protein interaction that is observed in the affinity chromatography 
experiment. A binding curve which has the proper shape indicates that the interaction that is 
observed is biologically important rather than a spurious interaction with denatured protein. 

10 

A series of columns is prepared with varying concentrations of protein ligand (mg protein 
ligand/ml resin volume). The number of columns employed is between 2 to 15, each with a 
different concentration of attached ligand. Preferably, 4 to 6 columns are prepared with varying 
concentrations of ligand. In addition, two control columns are generally prepared: one that 

1 5 contains no ligand and a second that contains the highest concentration of ligand but is not 
treated with extract. After elution of the columns and electrophoretic separation of the eluent 
components, the interacting proteins can be distinguished from the non-specific bound proteins. 
The concentration of the interacting proteins, as determined by the intensity of the band on the 
gel, will increase proportionally to the increase in protein ligand concentration but will be 

20 missing from the second control column. This allows for the identification of unknown 
interacting proteins. 

The reaction of the protein with the solid support can be terminated, if desired, but not 
necessarily, by reacting the support with ethanolamine. It has been standard practice to treat the 
25 column support resin with ethanolamine and bovine serum albumin (BSA) after the ligand 

protein is coupled. This was done to block the remaining reactive groups on the resin. We have 
found that it is preferable to avoid the treatment of the resin with BSA and ethanolamine. By 
omitting this treatment, we have found that the non-specific binding of proteins from an extract 
to the resin is reduced by about five-fold. 

30 

Micro-columns 

The method of the invention is ideal for small-scale analysis. A variety of column sizes, 
types, and geometries can be used. For high throughput analysis, it is advantageous to use small 
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volumes, about 20-100 pi. The column can be constructed in a glass capillary with a drawn-out 
tip or a plastic pipette tip. In order to retain the solid support in the capillary or pipette tip, the tip 
is blocked with glass beads, glass wool, filter paper, or a frit. The entire affinity chromatography 
procedure can be automated by assembling the micro-columns into an array format (e.g. with 96 
5 micro-column arrays or any other device containing multiple micro-columns). 

Preparation of Cell Extract 

The extract contains a mixture of proteins derived from a natural source. The extract may 
be a cellular extract or extracellular fluid. The choice of starting material for the extract is based 
1 0 upon the cell or tissue type or type of fluid that would be expected to contain proteins that 

interact with the target protein. Micro-organisms or other organisms are grown in a medium that 
is appropriate for that organism and can be grown in specific conditions to promote the 
expression of proteins that may interact with the target protein. 

1 5 The starting material used to make the extract is: 1) one or more types of tissue derived 

from an animal, plant, or other multi-cellular organism, 2) cells grown in tissue culture that were 
derived from an animal or human, plant or other source, 3) micro-organisms grown in suspension 
or non-suspension cultures, 4) virus-infected cells, 5) purified organelles (including, but not 
restricted to nuclei, mitochondria, membranes, Golgi, endoplasmic reticulum, lysosomes, or 

20 peroxisomes) prepared by differential centrifugation or another procedure from animal, plant or 
other kinds of eukaryotic cells, 6) serum or other bodily fluids including, but not limited to, 
blood, urine, semen, synovial fluid, cerebrospinal fluid, amniotic fluid, lymphatic fluid or 
interstitial fluid. 

25 Whole cell extracts are generally used as the source of interacting proteins. In some 

cases, a total cell extract may not be the optimal source of interacting proteins. For example, if 
the ligand is known to act in the nucleus, a nuclear extract can provide a 10-fold enrichment of 
proteins that are likely to interact with the ligand. In addition, proteins that are present in the 
extract in low concentrations can be enriched using another chromatographic method to 

30 fractionate the extract before screening various pools for an interacting protein. 

The cells are lysed by standard methods, including, but not limited to enzymatic lysis, 
grinding with alumina or another abrasive, use of a French pressure cell, sonication, treatment 
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with detergent, beating with glass beads in a bead beater or blender, cryogenic grinding, exposure 
to differential osmotic pressure, use of a mill, or use of a Dounce homogenizer. It is 
advantageous to carry out the procedure at a low temperature (e.g., 4°C) in order to retard 
denaturation or degradation of proteins in the extract. The tissue or cells or cell extract is 
5 suspended in a solution containing Tris or Hepes or another biological buffer that is standard in 
the art at a concentration that is adequate to establish the pH of the extract. The pH is adjusted to 
be appropriate for the body fluid or tissue, cellular, or organellar source that is used for the 
procedure (e.g. ph7-8 for cytosolic extracts from mammals, but low pH for lysosomal extracts). 
The concentration of chaotropic or non-chaotropic salts in the extracting solution is adjusted so 
10 as to extract the appropriate sets of proteins for the procedure. Glycerol may be added to the 
lysate, as it aids in maintaining the stability of many proteins and also reduces background non- 
specific binding. Both the lysis buffer and column buffer should contain protease inhibitors to 
minimize proteolytic degredation of proteins in the extract and to protect the ligand. Appropriate 

5; co-factors that could potentially interact with the interacting proteins can be added to the 
5 extracting solution. One or more nucleases or another reagent is added to the extract, if 
appropriate, to prevent protein-protein interactions that are mediated by nucleic acids. 

W Appropriate detergents or other agents are added to the solution, if desired, to extract membrane 

J proteins from the cells or tissue. A reducing agent (e.g. dithiothreitol or 2-mercaptoethanol or 
glutathione or other agent) can be added to extracts derived from cells, but is more often omitted 

u 20 when the source of protein extract is derived from an extracellular source. Trace metals or a 

W chelating agent can be added, if desired, to the extracting solution. 

O The extract is centrifuged in a centrifuge or ultracentrifuge or filtered to provide a 

clarified supernatant solution. This supernatant solution may be dialyzed using dialysis tubing, 

25 or another kind of device that is standard in the art, against a solution that is similar to, but may 
not be identical with, the solution that was used to make the extract. An example of a change in 
the dialysis solution is to adjust the concentrations of salts to the ones that will be used for the 
affinity chromatography procedure. The dialysis procedure can last from less than an hour to 
many hours and can be omitted for fluids derived from extracellular sources or, in some cases, 

30 for extracts derived from intracellular sources. After dialysis, the extract containing naturally 
occurring proteins can be used immediately, stored for a short time, stored for many hours at a 
low temperature or stored in a frozen state at a low temperature (e.g. -80 °C). The extract is 
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clarified by centrifugation or filtration again immediately prior to its use in affinity 
chromatography. 

In some cases, the crude lysate will contain small molecules that can interfere with the 
5 affinity chromatography. This can be remedied by precipitating proteins with ammonium 
sulfate, centrifugation of the precipitate, and re-suspending the proteins in the affinity column 
buffer followed by dialysis. An additional centrifugation of the sample may be needed to remove 
any particulate matter prior to application to the affinity columns. 

1 0 The amount of cell extract applied to the column is important for two opposing reasons. 

If too little extract is applied to the column and the interacting protein is present at low 
concentration, the level of interacting protein retained by the column may be difficult to detect. 
Conversely, if too much extract is applied to the column, protein may precipitate on the column 
or competition by abundant interacting proteins for the limited amount of protein ligand may 

1 5 result in a difficulty in detecting minor species. 

Affinity Chromatography 

The columns are loaded with protein extract from an appropriate source that has been 
dialyzed against a buffer that is consistent with the nature of the expected interaction. Glycerol 

20 is normally included in the buffer. Any standard biological buffer can be used. The pH, salt 
concentrations and the presence or absence of reducing and chelating agents, trace metals, 
detergents, and co-factors may be adjusted according to the nature of the expected interaction. 
Most commonly, the pH and the ionic strength are chosen so as to be close to physiological for 
the source of the extract. The extract is most commonly loaded under gravity onto the columns 

25 at a flow rate of about 4-6 column volumes per hour, but this flow rate can be adjusted for 
particular circumstances in an automated procedure. 

The volume of the extract that is loaded on the columns can be varied but is most 
commonly equivalent to about 5 to 10 column volumes. When large volumes of extract are 
30 loaded on the columns, there is an improvement in the signal-to-noise ratio because more protein 
from the extract is available to bind to the protein ligand, whereas the background binding of 
proteins from the extract to the solid support saturates with low amounts of extract. 



15 



A control column is included that contains the highest concentration of protein ligand, but 
buffer rather than extract is loaded onto this column. The elutions (eluates) from this column 
will contain ligand protein that failed to be attached to the column in a covalent manner, but no 
proteins that are derived from the extract. 

5 

The columns are washed with a buffer appropriate to the nature of the interaction being 
analyzed, usually, but not necessarily, the same as the loading buffer. An elution buffer with an 
appropriate pH, glycerol, and the presence or absence of reducing agent, chelating agent, 
cofactors, and detergents are all important considerations. The columns are washed with about 5 
10 to 20 column volumes of each wash buffer to eliminate unbound proteins from the natural 
extract. The flow rate of the wash is usually adjusted to about 4 to 6 column volumes per hour 
by using gravity or an automated procedure, but other flow rates are possible in specific 
circumstances. 



15 In order to elute the proteins that have been retained by the column, the interactions 

between the extract proteins and the column ligand are disrupted. This is performed by eluting 
the column with a solution of salt or detergent. Retention of activity by the eluted proteins 
normally requires the presence of glycerol and a good buffer of appropriate pH, as well as proper 
choices of ionic strength and the presence or absence of appropriate reducing agent, chelating 

20 agent, trace metals, cofactors, detergents, chaotropic agents, and other reagents. If physical 
identification of the bound proteins is the objective, the elution can be performed sequentially, 
first with buffer of high ionic strength and then with buffer containing a protein denaturant, most 
commonly, but not restricted to sodium dodecyl sulfate (SDS), urea, or guanidine hydrochloride. 
We have found that it is advantageous to simply elute the column with a protein denaturant, 

25 particularly SDS, for example as a 1% SDS solution. Using only the SDS wash, and omitting the 
salt wash results in SDS-gels that have higher resolution (sharper bands with less smearing). 
This makes it easier to visualize specifically bound proteins against the background of non- 
specifically bound proteins. In addition, using only the SDS wash results in half as many 
samples to analyze by electrophoresis. The number of samples to be analyzed is an important 

30 consideration for the development of high throughput techniques. The volume of the eluting 
solution can be varied but is normally about 2 to 4 column volumes. For 20 |il columns, the flow 
rate of the eluting procedures are most commonly about 4 to 6 column volumes per hour, under 
gravity, but can be varied in an automated procedure. 
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Separation of Eluent Components 

The proteins from the extract that were bound to and are eluted from the affinity columns 
can be most easily resolved for identification by an electrophoresis procedure, but this procedure 
can be omitted and one can proceed directly to identification by mass spectrometry. Most 
commonly, it is easiest and most effective to use polyacrylamide gel electrophoresis (PAGE) on 
a slab gel, but any of the denaturing or non-denaturing electrophoresis procedures that are 
standard in the art can be used for this purpose, including gradient gels, capillary electrophoresis, 
and two-dimensional gels with isoelectric focusing in the first dimension and SDS-PAGE in the 
second. Preferably, the individual components in the column eluent are separated by 
polyacrylamide gel electrophoresis. 

Protein bands or spots are visualized using a staining technique such as Coomassie blue 
or silver staining, or some other agent that is standard in the art that does not interfere with 
protein identification by mass spectrometry. Silver staining is preferred as it provides a lower 
detection limit, involves less time for sample preparation and does not lead to protein 
modifications. Alternatively, autoradiography can be used for visualizing proteins isolated from 
organisms cultured on media containing a radioactive label, for example 35 S0 4 2_ or 
35 [S]methionine, that is incorporated into the proteins. Radioactive labeling has the advantage of 
allowing detection and quantitation by scintillation counting of fractions containing binding 
proteins before polyacrylamide gel electrophoresis. Additionally, the use of radioactively 
labelled extract allows a distinction to be made between extract proteins that were retained by the 
column and proteolytic fragments of the ligand that may be released from the column. However, 
radioactive labeling is relatively expensive without being more sensitive than silver staining. 
The radioactive extracts must be handled more carefully, and autoradiography is generally slower 
than silver staining. 

Protein bands that are derived from the extract (i.e. it did not elute from the control 
column that was not loaded with protein from the extract) and bound to an experimental column 
that contained protein ligand covalently attached to the solid support, and did not bind to a 
control column that did not contain any protein ligand, are excised from the stained 
electrophoretic gel with a clean instrument, usually a scalpel, and further processed for mass 
spectrometry. 
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Identification of the protein by mass spectrometry is greatly facilitated if the disulfide 
bonds of the protein are reduced and the free thiols are alkylated after reduction and prior to 
digestion of the protein with protease. The reduction is performed by treatment of the gel slice 
with a reducing agent, for example with dithiothreitol. The protein is alkylated by treating the 
5 gel slice with a suitable alkylating agent, for example iodoacetamide. 

Prior to analysis by mass spectrometry, the protein may be chemically or enzymatically 
digested. The protein sample in the gel slice is subjected to in-gel digestion. Shevchenko A. et 
al., Mass Spectrometric Sequencing of Proteins from Silver Stained Polyacrylamide Gels. 
10 Analytical Chemistry 1996, 58, 850-858. The preferred method of digestion is by treatment with 
the enzyme trypsin. The resulting peptides are extracted from the gel slice into a buffer. 

The peptide fragments may be purified, for example by use of chromatography. A solid 
support that differentially binds the peptides and not the other compounds derived from the gel 
1 5 slice, the protease reaction or the peptide extract may be used. The peptides are eluted from the 
solid support into a small volume of a solution that is compatible with mass spectrometry (e.g. 
50% acetonitrile/0.1% trifluoroacetic acid) 

The preparation of a protein sample from a gel slice that is suitable for mass spectrometry 
20 can also be done by an automated procedure. 

Mass Spectrometry 

Peptide samples derived from gel slices can be analyzed by any one of a variety of 
techniques in mass spectrometry, including, but not limited to matrix-assisted laser desorption 

25 ionization time-of-flight mass spectrometry (MALDI-TOF), triple quadrupole MS using either 
electrospray MS, electrospray tandem MS, nano-electrospray MS, or nano-electrospray tandem 
MS, as well as ion trap or Fourier transform mass spectrometry, or mass spectrometers 
comprised of components from any one of the above mentioned types (e.g. quadrupole-TOF). 
This analysis can be performed with any mass spectrometer that has the capability of measuring 

30 the peptide masses with high mass accuracy, precision, and resolution, as well as the capability 
of measuring the masses of fragments generated from a specific peptide when analyzed under 
conditions that induce dissociation of the peptide. 
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Eluates from the affinity chromatography columns can also be analyzed directly without 
resolution by electrophoretic methods, by proteolytic digestion with a protease in solution, 
followed by applying the proteolytic digestion products to a reverse phase column and eluting 
the peptides from the column directly into a mass spectrometer using an electrospray or nano- 
5 electrospray sample introduction interface. For example, peptides may be eluted directly into an 
ion trap or triple quadrupole mass spectrometer. 

Methods that use a MALDI-TOF instrument are, however, more rapid and preferred for 
high throughput procedures because it takes approximately 30 seconds to analyze a sample by 
10 MALDI-TOF in an automated procedure, whereas it takes approximately one hour to introduce 
samples into the other kinds of instruments via micro-capillary HPLC. 

If MALDI-TOF is used to analyze the peptides from the digested interacting protein, the 
method yields a high accuracy peptide mass spectrum. Patterson, Electrophoresis 1995, 16, 1 104- 

15 14. The peptide masses obtained from MALDI-TOF are used for correlative database searching 
of protein or DNA sequence databases. Yates et al., Anal. Biochem. 1993, 214, 397-408. The 
molecular weights of the peptides are compared with a database of peptides from predicted 
proteins encoded by the organism's genome. This sensitive method is able to characterize 
proteins that are present at very low concentration, as low as sub-picomole levels. This method 

20 is suitable for high throughput identification of proteins, particularly for organisms whose 
genomes are sequenced. 

This method allows the rapid and accurate mapping of peptide mixtures by measuring the 
molecular weight of each component. The peptide mixture is generated by sequence-dependent 

25 cleavage of the polypeptide backbone by proteolytic enzymes or chemical agents. The peptide 
map obtained by specific cleavage or digestion, for example with trypsin, results in a unique 
peptide fingerprint for a given protein. Thus in the case of mass spectrometric mapping, the 
experimental data are a partial or complete set of molecular weights of peptides resulting from 
the cleavage (digestion) of the protein. The peptide masses are searched against both in-house 

30 proprietary and public databases using a correlative mass matching algorithm. Statistical 

analysis is performed upon each protein match to determine the validity of the match. Typical 
constraints include error tolerances within 0.1 Da for monoisotopic peptide masses. Cysteines 
are alkylated and searched as carboxyamidomethyl modifications. Identified proteins are stored 
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automatically in a relational database with software links to SDS-PAGE images and ligand 
sequences. Often, even a partial peptide map is specific enough for identification of the protein. 
If no match is found, a more error-tolerant search can be used, for example using fewer peptides 
or allowing a larger margin for error. In these cases the tentative identity of the interacting 
5 protein should be confirmed by a second method. 

This technique is used to assign function to an unknown protein based upon the known 
function of the interacting protein in the same or a homologous/orthologous organism. Protein- 
protein interactions are stored in a relational database to create an c in-silico' network of protein 
10 interactions with the predicted effect each protein has upon cellular functions. 

The knowledge gained from the relational database is used to select protein targets for 
further analysis including the immobilization of one or more interacting partners on a solid 
support and screening a chemical or drug library for compounds that affect the interaction. The 
1 5 chemicals or drugs are screened for there ability to influence the protein-protein interaction. 

Tandem mass spectrometry or post source decay is used for proteins that cannot be 
identified by peptide-mass matching or to confirm the identity of proteins that are tentatively 
identified by an error-tolerant peptide mass search, described above. This method combines two 

20 consecutive stages of mass analysis to detect secondary fragment ions that are formed from a 
particular precursor ion. The first stage serves to isolate a particular ion of a particular peptide 
(polypeptide) of interest based on its m/z. The second stage is used to analyze the product ions 
formed by spontaneous or induced fragmentation of the selected ion precursor. Interpretation of 
the resulting spectrum provides limited sequence information for the peptide of interest. 

25 However, it is faster to use the masses of the observed peptide fragment ions to search an 

appropriate protein sequence database and identify the protein as described in Griffin et al, Rapid 
Commun. Mass. Spectrom. 1995, 9, 1546-51. 

Peptide fragment ions are produced primarily by breakage of the amide bonds that join 
30 adjacent amino acids. The fragmentation of peptides in mass spectrometry has been well 
described (Falick et al., J. Am Soc. Mass Spectrom. 1993, 4, 882-893; Biemann, K., Biomed. 
Environ. Mass Spectrom. 1988, 16, 99-111) 
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The following examples are provided as illustrative of the present invention and are not limiting. 



Example 1: Protein SA0005 

5 A protein from the bacterium Staphylococcus aureus, labeled SA0005, was chosen for 

use as the ligand. SA0005 was determined to have high homology to heat shock protein 33, a 
putative chaperone involved in protein folding. 

Production of SA0005 

10 A bioinformatics program (A.L. Delcher, D. Harmon, S. Kasif, O. White, and S.L. 

Salzberg. Improved microbial gene identification with GLIMMER, Nucleic Acids Research, 
1999, 27:, 4636-4641) is used to select the coding sequence of interest from the genome of & 
aureus. The coding DNA is amplified from purified genomic DNA by using PGR with primers 
that are identified with a computer program. The PCR primers are selected so as to introduce 

15 restriction enzyme cleavage sites at the ends of the DNA (e.g. Ndel and BamHl). The PCR 
product is purified by gel electrophoresis and directionally cloned into the polylinker of the 
expression vector pET15b (Novagen, WI) after the polylinker is cut with the same two restriction 
enzymes. After the ligation reaction, the DNA is transformed into E. coli bacteria that will allow 
the production of the recombinant protein in high yield. The expression vector uses a promoter 

20 for the RNA polymerase of bacteriophage T7, and the strain of E. coli is able to produce T7 RNA 
polymerase when isopropyl-{3-D-thiogalactoside (IPTG) is added to the growth medium. The 
sequence of the cloning site is such as to add polyhistidine, followed by a cleavage site for the 
enzyme thrombin, to the amino-terrninal of the recombinant heterologous protein. Bacteria 
containing the recombinant plasmid are selected for by antibiotic resistance, indicating they have 

25 acquired the plasmid, and identified either by using PCR or another method to analyze their 
DNA or by using SDS-PAGE or mass spectrometry to identify clones that produce the desired 
protein in large amounts. 

A clone that produces the desired recombinant heterologous protein in large amounts is 
30 grown in Luria broth or another mediun. IPTG is added when the culture has reached an 
appropriate cell density and then the culture is incubated overnight at 15°C, harvested by 
centrifugation at 5000 rpm for 15 minutes, and broken by sonication. The extract is clarified by 
centrifugation at 15000 rpm for 30 minutes. Nucleic acid is removed from the clarified extract 
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by passing the extract through a DE52 column in a buffer containing 500 mM NaCl. The 
recombinant protein is then bound to a nickel column and eluted with buffer containing 
imidazole. After the imidazole is removed from the preparation by dialysis, the tag is removed 
from the protein by digestion with thrombin and the mixture is passed through another nickel 
column. The recombinant heterologous protein without the polyhistidine tag flows through the 
second nickel column, now highly purified and ready for use in affinity chromatography. 

Staphylococcus aureus extract preparation : 

A Staphylococcus aureus extract is prepared from cell pellets using nuclease and 
lysostaphin digestion followed by sonication. A Staphylococcus aureus cell pellet (12g) is 
suspended in 12 ml of 20 mM Hepes pH 7.5, 150 mM NaCl, 10% glycerol, 10 mM MgS0 4 , 10 
mM CaCl 2 , 1 mM DTT, 1 mM PMSF, 1 mM benzamidine, 1000 units of lysostaphin, 0.5 mg 
RNAse A, 750 units micrococcal nuclease, and 375 units DNAse 1. The cell suspension is 
incubated at 37°C for 30 minutes, cooled to 4°C, and is made up to a final concentration of 1 
mM EDTA and 500 mM NaCl. The lysate is sonicated on ice using three bursts of 20 seconds 
each. The lysate is centrifuged at 20 000 rpm for 1 hr in a Ti70 fixed angle Beckman rotor. The 
supernatant is removed and dialyzed overnight in a 10 000 Mr dialysis membrane against ACB 
(20 mM Hepes pH 7.5, 10 % glycerol, 1 mM DTT, and 1 mM EDTA) containing 100 mM NaCl, 
ImM benzamidine, and 1 mM PMSF. The dialyzed protein extract is removed from the dialysis 
tubing and frozen in one ml aliquots at -70 °C. 

Preparation of Affinity Column 

A series of solutions of the ligand (SA0005) is prepared so as to give final amounts of 0, 
0.1, 0.5, 1 .0, and 2.0 mg of ligand per ml of resin volume. Assuming that the stock solution of 
ligand has a concentration of 3.5 mg/ml the following samples are prepared in labelled silanized 
microcentifuge tubes: 



ligand cone, on resin 
volume of resin (ul) 
Protein (ug) 
protein (ul) 
ACB buffer (ul) 



0 

100 
0 
0.0 
300 



0.1 
100 
10 
2.9 
297.1 



0.5 
100 
50 
14.5 
285.5 



100 
100 
28.9 
271.1 



2 
100 
200 
57.8 
242.2 
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A slurry of Affigel 10 is prepared andl ml of slurry is removed (enough for six 100-ul 
aliquots of resin). Using a glass frit Buchner funnel, the resin is washed sequentially with three 
10 ml portions each of ice-cold isopropanol, distilled H 2 0, and ACB containing 1 M NaCl. The 
resin is completely drained of buffer, but not dried. Into six clean silanized microcentrifuge 
tubes is added 100 mg of the Affigel 10. The buffer containing the ligand concentration series, 
as shown in the table, is added to the tubes containing Affigel 10 and mixed gently. The tubes 
containing the coupling reactions are places on a rotator at 4°C overnight. After coupling, the 
Affigel 10 resin is centrifuged at 2000 rpm for 1 minute at 4°C, or alternatively, the beads are 
allowed to settle under gravity. The beads are isolated by removing the supernatant solution 
which is saved for later analysis to evaluate the coupling efficiency. 

To the Affigel 10 is added 300 ul of ACB containing 100 niM NaCl and 80 mM 
ethanolamine. The Affigel 10 is resuspended and rotated for 2 hours at 4°C. The remaining 
reactive groups react with the ethanolamine. The Affigel 10 resin is centrifuged at 2000 rpm for 
1 minute at 4°C, or the beads are allowed to settle under gravity. The supernatant is removed 
and discarded. As an option, add 300 ul of ACB containing 100 mM NaCl and 1 mg/ml of 
bovine serum albumin, resuspend the beads, and rotate for 2 hours. The Affigel 10 resin is 
centrifuged at 2000 rpm for 1 minute at 4°C, or allowed to settle under gravity, and the 
supernatant is removed and discarded. The resin is resuspended in 300 ul of ACB containing 1 
M NaCl. This step is repeated 3 times to wash away the free bovine serum albumin from the 
resin. The supernatant is removed and the resin is resuspended with 100 ul of ACB containing 
100 mM NaCl. 

The micro-columns are prepared by using forceps to bend the ends of P200 pipette tips. 
To the pipette tips is addedlOul of glass beads and 80ul of a 50% slurry of the Affigel 10 resin 
containing the covalently attached ligand protein. The columns are allowed to drain on ice in a 
1.5 ml microcentrifuge tube and are washed with 10 column volumes (400 ul) of ACB 
containing 100 mM NaCl. 

Affinity chromatography 

Ten column volumes of the S. aureus extract is added to each micro-column and the 
flow-throughs of the columns are removed when approximately 50 - lOOul accumulates. 
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Each column is washed in the same manner with 5 column volumes of ACB containing 100 mM 
NaCl. This washing is repeated once. Each column is washed with 5 column volumes of ACB 
containing 100 mM NaCl and 0.1% Triton X-100. The columns are eluted sequentially with 4 
column volumes of ACB containing 1M NaCl and 4 column volumes of 1% sodium dodecyl 
5 sulfate into clean microcentrifuge tubes. To each eluted fraction is added one-tenth volume of 
10-fold concentrated loading buffer for SDS-PAGE. 

Resolution of the eluted proteins and detectio n of bound proteins 

The components of the eluted samples are resolved on SDS-polyacrylamide gels 
10 containing 13.8% polyacrylamide using the Laemmli buffer system. 

After the electrophoresis procedure is complete, the gel is stained in a clean glass tray. 
Using 500 ml of each rinse solution, the gel is treated sequentially with 
1) 50% methanol, 10% acetic acid overnight or for at least two hours to fix the gel. Repeat once 

1 5 for 20 minutes, 2) 20% ethanol for 1 0 minutes, 3) distilled water for 1 0 minutes, 4) sodium 
thiosulfate (0.2 g/litre) for lminute to reduce the gel, 5) water, twice for 20 seconds each wash, 
6) silver nitrate (2.0 g/litre) for 30 minutes, and 7) water for 20 seconds. The gel is washed once 
with developing solution (50 to 75 ml) for 30 seconds, and is developed to the desired intensity, 
until the band is visible (a light to dark brown). The developing solution contains sodium 

20 carbonate (30 g/litre), formaldehyde (1 .4 ml of 37% solution/litre), and sodium thiosulfate (10 
mg/litre). Once the desired stain intensity has been reached, the developing solution is removed 
quickly. The reaction is stopped by adding a 1% acetic acid solution and incubating for a 
minimum of 20 minutes. The gel is rinsed with 1% acetic acid. 

25 The gel is shown in Figure 1 . One interacting protein is apparent from the 1 % SDS eluates. 

The bands containing the interacting protein are excised with a clean scalpel. The gel 
volume is kept to a minimum by cutting as close to the band as possible. The gel slice is placed 
into a clean 0.5 ml microcentrifuge tube. To the gel slices is added 10 to 20 ul of 1% acetic acid. 
30 The sample can be stored frozen at -70°C for an extended period of time. 
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Sample Preparation for Mass Spectrometry 

The gel slices are cut into 1mm cubes and 10 to 20 jlxI of 1% acetic acid is added. The gel 
particles are washed with 100 - 150|il of HPLC grade water (5 minutes with occasional mixing), 
briefly centrifuged and the liquid is removed. Acetonitrile (-200 |il, approximately 3 to 4 times 
5 the volume of the gel particles) is added followed by incubation at room temperature for 10 to 1 5 
minutes with occasional mixing. A second acetonitrile wash may be required to completely 
shrink the gel particles. The sample is briefly centrifuged and all the liquid is removed. 

The protein in the gel particles is reduced by covering the gel slices with 100 mM 
10 ammonium bicarbonate containing 10 mM dithiothreitol and incubating at 50°C for 30 minutes. 
Briefly centrifuge and remove all the liquid. Acetonitrile is added to shrink the gel particles and 
the excess liquid is removed. The protein in the gel particles is alkylated by covering the gel 
particles with 100 mM ammonium bicarbonate containing 55 mM iodoacetamide and incubating 
for 20 minutes at room temperature in the dark. The sample is briefly centrifuged and all the 
15 liquid is removed. The gel particles are washed with 150 to 200jal of 100 mM ammonium 

bicarbonate fori 5 minutes with occasional mixing. The sample is briefly centrifuged and all the 
liquid is removed. Acetonitrile is added to shrink the gel particles and the excess liquid is 
removed. The sample is briefly centrifuged and all the liquid is removed. The gel particles are 
dried using a centrifugal vacuum concentrator for 1 minute. 

20 

To digest the interacting protein, the gel particles are rehydrated in digestion buffer 
containing trypsin (50 mM ammonium bicarbonate, 5 mM CaCl 2? and 12.5 ng/ul trypsin) on ice 
for 30 to 45 minutes (after 20 minutes incubation more trypsin solution is added). The excess 
trypsin solution is removed and 10 to 15 ptl digestion buffer without trypsin is added to ensure 
25 the gel particles remain hydrated during digestion. The samples are incubated at 37°C overnight. 

The samples are briefly centrifuged and all the liquid is transferred to a clean 
microcentrifuge tube (0.5 ml)(step 1). To the gel particles is added 100 |il of 100 mM 
ammonium bicarbonate and the peptides are extracted by shaking at 37° C in an orbital shaker for 
30 30 minutes followed by centrifugation. The liquid (step 2) is pooled with the liquid from step 1 . 
A second portion of 100 jil of 100 mM ammonium bicarbonate is added to the gel particles and 
the peptides are extracted a second time by shaking at 37 °C in an orbital shaker for 30 minutes 
followed by centrifugation. The liquid is pooled with the liquid from steps 1 and 2. 
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Purification of the tryptic peptides 

Bulk CI 8 reverse phase resin is washed several times with methanol and with 65% 
acetonitrile prior to use and a 5:1 slurry is prepared with 65% acetonitrile/1% acetic acid. Five jjtl 
of the CI 8 slurry are added to the extracted peptides and shaken for 30 minutes at 37 °C. The 
5 supernatant is removed and 150 jul of 2% acetonitrile/1% acetic acid are added and shaken for 5 
to 15 minutes at 37°C. All of the supernatant is removed and 10 tol5 \il of 65% acetonitrile / 1% 
acetic acid are added. The sample is vortexed briefly and incubated for 5 minutes with 
occasional mixing. The sample is centrifuged and the supernatant is removed to a fresh tube for 
analysis by mass spectrometry. 

10 

Mass spectrometric analysis 

Analytical samples containing tryptic peptides are subjected to Matrix Assisted Laser 
Desorption/Ionization Time Of Flight (MALDI-TOF) mass spectrometry. Samples are initially 
mixed with an equal volume of organic solvent containing a compound (matrix) that ionizes 

15 peptides upon excitation by a laser pulse. The matrix could be one of a-cyano-4-hydroxy-frara- 
cinnamic acid, sinnipinic acid, or 2,5-dihydroxybenzoic acid. The mixture of the sample and 
matrix is allowed to dry on a sample stage and introduced into the mass spectrometer. 
Specifically, 0.5 jul matrix solution containing 20 mg/ml a-cyano-4-hydroxy-/ram-cinnamic acid 
in 50% acetonitrile/1% acetic acid is mixed with 0.5 ul sample and applied to a well of a multi- 

20 sample MALDI-TOF plate. Analysis of the peptides in the mass spectrometer is carried out 
using delayed extraction and an ion reflector to ensure high resolution of peptides. The 
instrument is initially calibrated using the autohydrolysis peaks generated by trypsin, but the 
method is not dependent upon trypsin and any protease having a defined cleavage specificity 
may be used. 

25 

Tryptic peptide masses are searched against both in-house proprietary and public 
databases using a correlative mass matching algorithm. Twenty peptide masses were used in the 
search. Statistical analysis is performed upon each protein match to determine the validity of the 
match. Typical constraints include error tolerances within 0.1 Da for monoisotopic peptide 
30 masses. Cysteines are alkylated and are searched as carboxyamidomethyl modifications. 

Identified proteins are stored automatically in a relational database with software links to SDS- 
PAGE images and ligand sequences. The tryptic peptide mass spectrum is shown in Figure 2. 
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The closest protein match from the correlative search and the probability of a correct match for 
the five closest protein matches are shown in Table 1 . 



Table 1: Results of correlative database searching of 20 peptide masses. 



Rank 


Probability 


Name 


1 


1.0e+00 


EF-Tu 


2 


1.0e-17 




3 


7.7e-18 




4 


1.4e-18 




5 


2.0e-19 





One interacting protein was discovered and identified as a truncated form of EF-Tu, 
whose intact form is a key factor involved in protein biosynthesis. This form of EF-Tu is novel. 
15 It is most likely made by intracellular proteolysis from intact EF-Tu. It could be involved in 
protein synthesis in S. aureus or could have some other function. The chaperone, if it is one, 
could be involved in the folding of the EF-Tu fragment or in its assembly with some other 
protein. 

20 

Examples 2 -5 are performed using the procedures of Example 1. 



Example 2: protein SA0146 

25 A protein from the bacterium Staphylococcus aureus, labeled SA0146, was chosen for 

use as the ligand. SA0146 was found to be a homolog of the B. subtilis cell division initiation 
protein, DIV IV A, which is involved in septum formation. 



30 



27 



Table 2: Results of correlative database searching of 14 peptide masses . 



Rank 


Probability 


Name 


1 

i 


1 Oe+00 


conserved protein of unknown function 


2 


1.7e-ll 




3 


4.0e-12 




4 


8.5e-13 




5 


8.2e-13 





1 0 The interacting protein was found to be a conserved protein of unknown function. The 

data suggests that the interacting conserved protein is also involved in cell division. It could be a 
good drug target because cell division is an essential process. 

Example 3: protein SA0203 

1 5 An unknown protein from the bacterium Staphylococcus aureus, labeled S A0203 , was 

chosen for use as the ligand. The function of S A0203 is unknown. 



Table 3: Results of correlative database searching of 15 peptide masses. 



Rank 


Probability 


Name 


1 


1. Oe+00 


peptide chain release factor 3 


2 


2.4e-07 




3 


2.5e-08 




4 


l.le-08 




5 


8.1e-09 





25 

The interacting protein was found to be a homologue of peptide chain release factor 3. Its 
interaction with peptide chain release factor 3 suggests that it is involved in the termination stage 
of protein synthesis. It could potentially be a good drug target because many antibiotics inhibit 
protein synthesis. 

30 
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Example 4: protein SA0276 

A protein from the bacterium Staphylococcus aureus, labeled SA0276, was chosen for 
use as the ligand. Because of its high homology to other bacterial homologues, SA0276 was 
labeled a putative phenylalanine tRNA synthetase subunit, although only part of its sequence is a 
5 good match to enzymes of that type in other species. 

Table 4: Identification of Interactor 1, results of correlative database searching of 29 



peptide masses. 



10 



Rank 


Probability 


Name 


1 


1.0e+00 


glutamyl-tRNA Gin amidotransferase subunit B 


2 


7.7e-22 




3 


6.3e-22 




4 


5.1e-23 




5 


6.4e-24 




Table 5: Identification of Interactor 2, results of correlative database searching of: 
peptide masses. 


Rank 


Probability 


Name 


1 


1.0e+00 


glutamyl-tRNA Gin amidotransferase subunit A 


2 


1.9e-13 




3 


1.3e-14 




4 


3.4e-15 




5 


1.7e-15 





25 Two interacting proteins were discovered and identified as homologues of glutamyl- 

tRNA Gin amidotransferase subunits A and B. 

In S. aureus and perhaps other organisms, SA0276 may have an additional function in 
which it interacts with a portion of glutamyl tRNA and acts as a cofactor for glutamyl-tRNA 
30 glutamine amidotransferase. If so, that might have a vital function outside of charging 
phenylalanine tRNA, and chemicals that inhibit that activity could be good antibiotics. 
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Example 5: protein SA0526 

A protein from the bacterium Staphylococcus aureus, labeled SA0526, was chosen for 
use as the ligand. SA0526 was determined to be a homologue of EF-Ts, a protein synthesis 
elongation factor that is conserved in all bacteria. 

5 

Table 6: Results of correlative database searching of 14 peptide masses. 



Rank 


Probability 


Name 


1 


1.0e+00 


EF-Tu 


2 


4.6e-12 




3 


3.7e-12 




4 


1.6e-13 




5 


1.4e-13 





The interacting protein was found to be a homologue of EF-Tu. The interaction of EF-Tu 
1 5 with EF-Ts, which is confirmed in this experiment, has been known for more than 30 years. 

Example 6: protein SA0808 

A protein from the bacterium Staphylococcus aureus, labeled SA0808, was chosen for 
use as the ligand. SA0808 was determined to be homologous to menaquinone biosynthesis 
20 methyltransferase, an enzyme involved in the last step in the synthesis of menaquinone (vitamin 
K). 

SA0808 was prepared in a manner analogous to example 1 . 

25 S. aureus extract preparation : 

A S. aureus cell pellet (~12g) is suspended in 20 ml of lysis buffer (20 mM Hepes pH 
7.5, 500 mM NaCl, 10% glycerol, 10 mM MgS0 4 , 10 mM CaCl 2 , 1 mM DTT, ImM EDTA, 1 
mM PMSF, 1 mM benzamidine). The nucleases Rnase A (40 ug/ml final) and micrococcal 
nuclease (75 units/mL) are added. The cells are lysed with 10 pulses of 30 sec. between 90 sec. 

30 pauses using the Bead-Beater apparatus (Biospec Products Inc.). The outer chamber of the 
apparatus is filled with ice and the inner chamber with a 50/50 mixture of cells and zirconia 
beads (0.1 mm diameter). The lysate is separated from the zirconia beads using a standard 
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chromatography column and peristalic pump. The lysate is centrifuged at 20000 rpm (48000 x g) 
in Oak Ridge tubes (50 mL capacity) in a Beckman JA25.50 rotor. The extract is dialyzed 
against 1 L of 0.1 M ACB (20 mM Hepes pH 7.5, 100 mMNaCl, 10% glycerol, 10 mM MgS0 4 , 
10 mM CaCl 2 , 1 mM DTT, ImM EDTA, 1 mM PMSF, 1 mM benzamidine) overnight at 4°C in 
5 a dialysis membrane (Spectrum Labs, 10 kDa size exclusion). The extract is removed from the 
dialysis membrane and stored in 1 mL aliquots at -80 °C. 

Pre paration of Affinity Column 

A series of solutions of the ligand (SA0808) is prepared so as to give final amounts of 0, 
10 0.1, 0.5, 1 .0, and 2.0 mg of ligand per ml of resin volume. Assuming that the stock solution of 
ligand has a concentration of 3.5 mg/ml the following samples are prepared in labelled silanized 
microcentifuge tubes: 

ligand cone, on resin 
1 5 volume of resin (ul) 

Protein (ug) 
protein (ul) 
ACB buffer (pi) 

20 

A slurry of Affigel 10 is prepared andl ml of slurry is removed (enough for six 100-ul 
aliquots of resin). Using a glass frit Buchner funnel, the resin is washed sequentially with three 
10 ml portions each of ice-cold isopropanol, distilled H 2 0, and ACB containing 1 M NaCl. The 
resin is completely drained of buffer, but not dried. Into six clean silanized microcentrifuge 

25 tubes is added 1 00 mg of the Affigel 1 0. The buffer containing the ligand concentration series, 
as shown in the table, is added to the tubes containing Affigel 10 and gently mixed to suspend 
the resin. The tubes containing the coupling reactions are places on a rotator at 4°C overnight. 
After coupling, the Affigel 10 resin is centrifuged at 2000 rpm for 1 minute at 4°C. The beads 
are isolated by removing the supernatant solution. The supernatant of the 2mg/ml reaction is 

30 saved for later analysis to evaluate the coupling efficiency. 

To remove any free ligand, the resin is resuspended with 1 M ACB, centrifuged at 2000 
rpm, and the supernatant is removed. This is repeated twice more. The resin is resuspended with 
100 uL of 0.1 M ACB. 



0 


0.1 


0.5 


1 


2 


100 


100 


100 


100 


100 


0 


10 


50 


100 


200 


0.0 


2.9 


14.5 


28.9 


57.8 


300 


297.1 


285.5 


271.1 


242.2 
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The micro-columns are prepared by using forceps to bend the ends of P200 pipette tips. 
To the pipette tips is addedlOul of glass beads and 80^1 of a 50% slurry of the Affigel 10 resin 
containing the covalently attached ligand protein. The micro-columns are allowed to drain on ice 
in a 1.5 ml microcentrifuge tube. The micro-columns are adjusted to 40ul of resin (50 ul mark 
5 on tip) and are washed with 5 column volumes of ACB containing 1 OOmM NaCl. 



Affinity chromatography 

The extract is centrifuged in a microcentrifuge tube at 15000 rpm for 15 minutes at 4°C. 
1 0 The supernatant is removed to a fresh microcentrifuge tube and diluted to 5 mg protein/ml ACB 
containing lOOmMNaCl. 

Five column volumes of the S. aureus extract is added to each micro-column and the 
flow-throughs of the columns are removed when approximately 50 - lOOul accumulates. 
1 5 Each column is washed in the same manner with 5 column volumes of ACB containing 1 OOmM 
NaCl. This washing is repeated once. Each column is washed with 5 column volumes of ACB 
containing lOOmM NaCl and 0.1% Triton X-100. The columns are eluted sequentially with 4 
column volumes of 1% sodium dodecyl sulfate into clean microcentrifuge tubes. To each eluted 
fraction is added one-tenth volume of 10-fold concentrated gel loading buffer. 

20 

Resolution of the eluted proteins and detection of bou nd proteins 

The components of the eluted samples are resolved on polyacrylamide gels (no SDS is 
present in the gel, with 0.1% present in the gel running buffer) containing 13.8% 
polyacrylamide. 

25 

The gels are stained by silver staining using a mass spectrometry compatible protocol, as 
in Example 1. The gel is shown in Figure 11. 

The bands of interest are excised with a clean scalpel. The gel volume is kept to a 
30 minimum by cutting as close to the band as possible. The gel slice is placed into a clean 0.5 ml 
microcentrifuge tube. To the gel slices is added 10 to 20 pi of 1% acetic acid. The sample can 
be stored frozen at -70 °C for an extended period of time. 
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Sam ple Preparation for Mass Spectrometry 

The interacting proteins in the excised bands are digested with trypsin, and the resulting 
peptides are purified according to the procedures of Example 1. 

5 Mass spectrometric analysis 

The tryptic peptides are analyzed using MALDI-TOF mass spectrometry according to the 
procedures of Example 1. The tryptic peptide masses are searched against both in-house 
proprietary and public databases using a correlative mass matching algorithm. Statistical 
analysis is performed upon each protein match to determine the validity of the match. Typical 

1 0 constraints include error tolerances within 0. 1 Da for monoisotopic peptide masses. Cysteines 
are alkylated and are searched as carboxyamidomethyl modifications. Identified proteins are 
stored automatically in a relational database with software links to SDS-PAGE images and 
ligand sequences. The tryptic peptide mass spectra for the four interating proteins are shown in 
Figures 12a and 12b. The closest protein match from each correlative search and the probability 

15 of a correct match for the five closest protein matches are shown in Tables 7-10. 



Table 7: Identification of Interactor 1, results of correlative database searching of 27 
peptide masses. 



Rank 


Probability 


Name 


1 


1.0e+00 


elongation factor G 


2 


1.2e-28 




3 


2.4e-30 




4 


1.7e-30 




5 


1.6e-30 





30 
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Table 8: Identification of Interactor 2, results of correlative database searching of 21 
peptide masses. 



Rank 


Probability 


Name 


1 


1.0e+00 


trigger factor (prolyl isomerase) 


2 


2.9e-10 




3 


1.4e-10 




4 


7.1e-ll 




5 


5.0e-ll 




Table 9: Identification of Interactor 3, results of correlative database searching of 1 


peptide masses. 




Rank 


Probability 


Name 


1 


1.0e+00 


formate-tertrahydrofolate ligase 


2 


1.9e-07 




3 


7.3e-08 




4 


2.9e-08 




5 


1.8e-08 




Table 10: Identification of Interactor 4, results of correlative database searching oi 


peptide masses. 




Rank 


Probability 


Name 


1 


1.0e+00 


EF-Tu 


2 


1.3e-27 




3 


7.0e-28 




4 


1.0e-28 




5 


3.2e-29 
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Four interacting proteins are discovered and identified by MALDI-TOF mass 
spectrometry and correlative database searching as homologues of elongation factor G, trigger 
factor (prolyl isomerase), formate-tetrahydrofolate ligase, and EF-Tu. 

5 SA0808 is homologous to an enzyme involved in the last step in the synthesis of 

menaquinone (vitamin K). Its involvement in single carbon transfer as a methyltransferase could 
explain its interaction with formate-tetrahydrofolate ligase, an enzyme involved in one-carbon 
metabolism, but the exact connection is obscure. SA0808 also interacts with trigger factor, 
which is a prolyl isomerase. The prolyl isomerase could be involved in the proper folding of 

10 SA0808 or could have some other role in its activity. There is genetic evidence for the possible 
involvement of the homologue of SA08O8 of B. subtilis in spore germination, which involves the 
restart of a variety of metabolic processes, including protein synthesis. That could suggest that 
SA0808 has a previously unsuspected function in which it interacts with and perhaps modifies 
the protein synthesis factors EF-Tu and EF-G in order to control their activities. Interfering with 

15 this interaction could be a way to control the germination of bacteria. 

Examples 7-10 are performed using the procedures of Example 6. 

Example 7: protein SA0989 

20 A protein from the bacterium Staphylococcus aureus, labeled SA0989, is chosen for use 

as the ligand. SA0989 was determined to be homologous to 3-methyl-2-oxobutanoate 
dehydrogenase. 

Table 12: Identification of Interactor 1; Results of correlative database searching of 24 
25 peptide masses. 



Rank 


Probability 


Name 


1 


1.0e+00 


trigger factor (prolyl isomerase) 


2 


2.4e-20 




3 


1.9e-21 




4 


7.7e-22 




5 


7.7e-22 
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Table 13: Identification of Interactor 3; Results of correlative database searching of 13 
peptide masses. 



Rank 


Probability 


Name 


1 


1.0e+00 


enolase 


2 


1.4e-07 




3 


7.5e-08 




4 


6.0e-08 




5 


7.5e-09 





10 

Three interacting proteins are discovered Two are identified by MALDI-TOF mass spectrometry 
as homologoes of trigger factor (prolyl isomerase) and enolase. The third is unidentified. 

SA0989 is probably a branched chain ot-ketoacid dehydrogenase involved in the second 
1 5 step in the synthesis of branched chain amino acids. Trigger factor is a prolyl isomerase which 
could be involved in the folding of SA0989. SA0989 also interacts with another protein that has 
not yet been positively identified and with enolase. Although the interaction with enolase could 
have some significance that we do not appreciate, enolase has been found to bind to at least 20 of 
the proteins of £ aureus. Although it is possible that enolase has a chaperone-like function for 
20 many other proteins, it is also possible that enolase is a protein that interacts with many proteins 
in a fashion that is not biologically important. 

Example 8: protein SA1094 

25 A protein from the bacterium Staphylococcus aureus, labeled SA1 094, is chosen for use 

as the ligand. SA1094 is a protein of heretofore unknown function. 
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Table 14: Results of correlative database searching of 29 peptide masses. 



Rank 


Probability 


Name 


i 
i 


1 Oe+00 


putative petidase 


2 


1.7e-19 




3 


1.7e-21 




4 


3.9e-22 




5 


2.1e-22 





10 One interacting protein is discovered. The interactor is found to be a homologue of a 

putative peptidase. 

The interaction of SA1094 with a putative peptidase (based on homologues in other 
organisms) suggests that SA1094 is likely to be involved in peptide metabolism. 

15 

Example 9: protein SA1 185 

A protein from the bacterium Staphylococcus aureus, labeled SA1 185, is chosen for use 
as the ligand. SA1 1 85 is a protein of heretofore unknown function. 

20 Table 15: Identification of Interactor 1; Results of correlative database searching of 39 



peptide masses. 



Rank 


Probability 


Name 


1 


1. Oe+00 


glucose-6-phosphate isomerase 


2 


3.1e-36 




3 


2.6e-36 




4 


l.le-36 




5 


5.6e-37 
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Table 16: Identification of Interactor 2; Results of correlative database searching of 35 
peptide masses. 



Rank 


Probability 


Name 


1 


1.0e+00 


cysteine synthetase 


2 


7.9e-40 




3 


1.3e-41 




4 


5.1e-43 




5 


2.1e-43 





10 

Two interacting proteins are discovered. The identities of the interactors are determined 
by MALDI-TOF mass spectrometry as homologues of glucose-6-phosphate isomerase and 
cysteine synthetase. 

15 SA1 1 85 interacts with two enzymes of widely differing functions, glucose-6-phosphate 

isomerase involved in glucose metabolism and cysteine synthetase involved in the last step in 
cysteine biosynthesis. SA1 185 could be involved in controlling the activities or localizations of 
both enzymes. 



20 Example 10: protein SA1203 

A protein from the bacterium Staphylococcus aureus, labeled SA1203, is chosen for use 
as the ligand. SA1203 is a protein of heretofore unknown function. 



Table 12: Results of correlative database searching of 21 peptide masses. 



Rank 


Probability 


Name 


1 


1.0e+00 


NADH dehydrogenase. 


2 


1.9e-14 




3 


3.6e-16 




4 


4.2e-17 




5 


2.9e-17 
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One interacting protein is discovered. The interacting protein is a homologue of NADH 
dehydrogenase. 

SA1203's specific interaction with the respiratory enzyme NADH dehydrogenase 
suggests it could be involved in respiration and controlling the activity or membrane versus 
cytosolic location of that enzyme. 
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